Incorporation of temporal masking effects into bark spectral distortion measure

نویسنده

  • Bob Novorita
چکیده

The objective of this paper is to extend a promising objective speech distortion measurement method, the Bark Spectral Distance (BSD) measure, with the auditory concepts of forward and backward temporal masking to improve its measurement accuracy. The results of this investigation show that automatic BSD-based speech quality ratings may be made to correlate better with existing MOS ratings by removing perceptually irrelevant areas of speech from the distance measure. The correlation between the objective BSD measure to the subjective MOS measure increases from 0. 91 to 0. 98. The best results were found with a window duration of 128 samples, use of exponential-slope filter characteristics for both forward and backward masking effects, forward masking delays up to 100 msec, and a backward masking time advance of 40 msec.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of the modified Bark spectral distortion as an objective speech quality measure

The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1]. The MBSD measure takes into account the noise masking threshold in order to use only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD over the conventional BSD. In this paper, performance of t...

متن کامل

Enhanced Itakura measure incorporating masking properties of human auditory system

A new enhanced Itakura (E-Itakura) speech distortion measure is proposed in this paper. It incorporates masking properties of the human auditory system into the original Itakura measure. Inaudible noise components masked by speech signals are excluded from the calculation of the E-Itakura measure, while the intrinsic advantage of the Itakura measure is retained. The proposed new measure has bee...

متن کامل

Improvement of MBSD by scaling noise masking threshold and correlation analysis with MOS difference instead of MOS

The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1][2]. The MBSD measure estimates speech distortion in the loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD...

متن کامل

Comparison of two objective speech quality measures: MBSD and ITU-T Recommendation P.861

The Modified Bark Spectral Distortion (MBSD), used for an objective speech quality measure, was presented previously [1, 2]. The MBSD measure estimates speech distortion in loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD ove...

متن کامل

Comparative study of several distortion measures for speech recognition

In this study we compared several different spectral distortion measures including the Itakura-Saito (IS), the log likelihood ratio (LLR), the likelihood ratio (LR), the cepstral (CEP), and two perceptually based distortion measures, the weighted likelihood ratio (WLR) and the weighted slope metric (WSM) distortion measures, in terms of their effects on the performance of a standard dynamic tim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999